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Abstract 

Background: Recently developed techniques to study the spread of infectious diseases through net- 
works make assumptions that the initial proportion infected is infinitesimal and the population behavior 
is static throughout the epidemic. The models do not apply if the initial proportion is large (and fail 
whenever IZo < 1), and cannot measure the impact of an intervention. Methods: In this paper we adapt 
"edge-based compartmental models" to situations having finite-sized initial conditions. Results: The 
resulting models remain simple and accurately capture the effect of the initial conditions. It is possible 
to generalize the model to networks whose partnerships change in time. Conclusions: The resulting 
models can be applied to a range of important contexts. The models can be used to choose between 
different interventions that affect the disease or the population structure. 



Background 

The mathematical study of infectious disease spread has contributed significantly to our ability to design 
effective interventions to reduce disease spread. Most of the earliest models were based on the assumption 
that disease transmission occurs as a Poisson process and each transmission reaches an individual chosen 
randomly from the population. This implicitly assumes that partnership duration is very brief. These models 
have been modified to account for a number of different effects, such as demographic groups (lj. 

More recently, attempts have been made to incorporate the "network" structure of the population (see, 
e.g., [3]). Typically these focus on trying to understand the role played by "high-degree" individuals (those 
individuals with many contacts). Typically these studies come in one of two flavors: they either continue 
the assumption of fleeting partnerships (the disease spreads slowly compared to partnership turnover) [l] 
[8j|9j[T8|[2T] , or they take the opposite limit in which the partnership network is static (the disease spreads 
quickly compared to partnership turnover) [4{[5|[^ [Iu][Tl]|19||22] 

Recent work has shown that for susceptible-infectious-recovered (SIR) models, it is possible to unify 
these two approaches with an "edge-based compartmental model" that allows partnership duration to range 
continuously from zero to infinite [15] [for susceptible-infectious-susceptible (SIS) models, the picture is more 
complicated, see for example [6]]. The resulting models are low-dimensional and contain many standard 
models as special cases [16]. Unfortunately, these models are derived under the assumption that the initial 
proportion infected is infinitesimally small (while the absolute number infected is sufficiently large that the 
dynamics are deterministic). It is assumed that by the time the equations are used, any early transients 
have died away. A consequence of this assumption is that the models break down if 1Z < I or if the initial 
proportion infected is not negligible. 

The failure if the initial proportion infected is not negligible was observed by [7j. This paper used an 



early (static network) version of the equations of 15 from [22] and compared them with simulation. A small 
discrepancy in final sizes was noted. This discrepancy was not present for equations of [2] a system requiring 
O(M) equations where M is the maximum degree or for another system presented in [7] which required 
0(M 2 ) equations. 
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In this paper we will see that the discrepancy found by [7] results from the fact that the initial proportion 
infected was nonzero. We first show how to correct the equations to accommodate a non-negligible initial 
proportion infected. We then compare the model with simulations using different assumptions about how 
the initial infected individuals are chosen. Following that, we look at how the equations can be used to 
account for a change in population structure or disease parameters during the epidemic, such as might 
occur if an intervention is implemented. We then analyze the dynamical structure of the equilibria found 
in the equations, showing how the epidemic threshold is modified when a non-negligible proportion of the 
population is infected. Finally we discuss the assumptions underlying our approach and consequences when 
those assumptions fail. 



Methods 

We modify the approach of [15] which assumed an infinitesimal initial proportion infected. We adapt the 
approach to consider a wide range of possible initial conditions. We assume that the dynamics of the epidemic 
may be treated as deterministic, which means we assume the population is very large and the initial number 
infected is large enough for the epidemic to behave deterministically. If stochastic effects are still important 
but TZq > 1, then these equations may become accurate at a later time once sufficient numbers are infected. 

We assume the population consists of N 3> 1 individuals. Each is assigned a degree k independently 
of others, with probability P(k) where P defines a probability distribution on the (non-negative) integers. 
The network is wired together using the "Configuration Model" (or "Molloy-Reed" ) approach [T7j[l9j : each 
individual is assigned a number of stubs (or half-edges) equal to its degree. Pairs of stubs are then wired 
together to form edges/partnerships. It is likely that this algorithm produces a handful of self- loops or 
repeated edges, but generally the frequency of these goes to zero like 1/N. 

We define a test individual it to be a randomly chosen individual. Because we assume that the spread is 
deterministic, this means that the probability u is in a given state is equal to the proportion of the population 
that is in that state. So we focus on calculating the probability u is susceptible, infected, or recovered. We 
modify u so that it does not transmit to any of its partners if ever infected. This assumption does not 
affect the probability u is in any given state, but it does prevent a correlation between the status of different 
partners which would be caused by infection traveling through u. This allows us to treat the partners of u 
as independent. It is important to note that this assumption has no impact on the probability u is in any 
given state and therefore, it does not affect our calculation of the proportion of the population in each state. 



Further discussion of the test individual is in 14 



Variables 

We introduce our variables in table [I] The starting point is the test individual. The remaining variables can 
be broadly divided into three groups. S, I, and R denote the proportion of the population in each state, or 
equivalently the probability that the test individual u is in each state. 9, (f>s, 4>i, and 4>r give information 
about the probability a partner of u has a given status and the probability the partner has transmitted to 
u. P gives information about the possible degrees of u or its partners, while S(k, 0) gives information about 
the probability u is initially susceptible. Given P, and S(k, 0), we define i[>(x) = ^ S(k, 0)P(k)x k , so that 
ip(8(t)) is the probability u is susceptible. By noting that ip(8(t)) = S(t), we will be able to close our system 
of equations. 

The main distinction between this approach and the previous approach |15| is that we use just the initially 
susceptible individuals to define ip while the earlier work assumed S(k, 0) = 1 for all k. In this case, ip is a 
probability generating function for the degree distribution. 



Equation Derivation 

We will find a closed system of equations based on these variables. We begin by looking at S(t). If the test 
individual u has degree k and is susceptible at t = 0, then the probability it is susceptible at some later time 
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Variable 


Definition 


Test Individual u 


A randomly chosen member of the population who is prevented 
from causing infection. 


S(t) 


The proportion of the entire population that is susceptible. 


I(t) 


The proportion of the entire population that is infected. 


R(t) 


The proportion of the entire population that is recovered. 


9(t) 


The probability a random partner v of u which did not transmit 
to u by t = has not transmitted to u by time t. 


M*) 


The probability a random partner v which did not transmit to u 
by t = is susceptible at time t. 


Mt) 


The probability a random partner v which did not transmit to u 
by t = is infected at time t but has not transmitted to u. 


Mt) 


The probability a random partner v which did not transmit to u 
by t = is recovered at time t and never transmitted to u. 


p(k) 


The probability an individual has degree k. 




The average degree. 


S(k,0) 


The probability an individual with degree k is initially 
susceptible. 




The probability that the test individual u is susceptible at time 
t. In a large population this should equal S(t). 



Table 1: The variables we need to calculate the epidemic dynamics. In all of these u is a test individual: randomly 
chosen from the population and modified so that it cannot infect others, although it can become infected. 
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Figure 1: Flow diagram showing the flux of individuals between the different compartments. Because we have an 
explicit expression for S, if we know 9 we do not need to explicitly determine the flux from S to I. 

is 9(t) k . If we do not know k or whether u is susceptible at t = 0, then the probability u is susceptible at 
time t is sum over all k of the product of the probability u is initially susceptible S(k, 0) with the probability 
u is still susceptible 9 k . We have S(t) = J2 k S(k,0)P(k)9(t) k = ip(9(t)). Thus we conclude 

S(t) = i>(d(t)) 

Given our initial conditions on / and R, we know that R solves R = jl. We also have a conservation rule 
that S + 1 + R = 1, so 1=1 — S — R. Thus our equations are 

s = m 

I=l-S-R 
R = -yI 

Assuming 9(t) is known, then this system completely defines S, I, and R. This is shown in the flow diagram 
in figure [T] 

In order to close this system of equations we need an equation giving 9. Recall that 9(t) is the probability 
a partner v of u which had not yet transmitted to u by time has not transmitted by time t. This can 
be broken in three disjoint sub-compartments. If v has not yet transmitted to u by time t it is either 
susceptible, infected, or recovered. Thus, setting (f>s to be the probability that v is still susceptible, 4>j to be 
the probability v is infected but has not transmitted to u, and 4>R to be the probability v has recovered and 
did not transmit to u, we conclude that 6 = <fis + 4>i + 4>r. It is straightforward to see that 9 = —ft(f>i- So 
if we can find </>/ in terms of 9, then we arrive at a single equation for 9, which can be used to provide 9 for 
the S, I, and R equations. 

To do this, we use the fact that 4>i = 9 — 4>s — 4>r and find (f>s and 4>r in terms of 9. We turn to figure [2] 
The recovery rate is 7 and the transmission rate is j3, so we have <j>R = — J0//3. We can integrate this, and 
using the fact that 9(0) — 1 we find 4>r — 7(1 — 6)/j3 + <Pr(0). To find 4>s m terms of 9, we note that 
the probability u has an edge to a node which is susceptible at time t = is 05(0). The probability the 
susceptible partner has degree k is kP(k)S(k, 0)/^ kP(k)S(k, 0), so the probability an initially susceptible 
partner is susceptible at some later time is J2 k kP(k)S(k, 0)9 k ' 1 / J2 k kP{k)S(k, 0) = ip' (9) / ip' (1) . Thus 
<p s (t) = s (O)^'(0(i))/y(l). We arrive at 

<Pi = o-Mo)^-j(i-9)-Mo) 

and 9 = —fi<pi becomes 

9 = -(39 + f3^ s (0)^± + 7 (1 - 9) + ^ R (0) 
ip '(1) 

with 9(0) = L This completes our system. 
Our final closed system of equations is 

= _00 + ^ s (O)ffl+ 7 (l-0)+ J 80 Jl (O) (1) 
ip'(l) 

R = ~/I, S = ip(6), I = l-S-R (2) 

where 9(0) = 1, and R(0) is given by the initial conditions. These equations lead to earlier equations 



of li™ if K > 1 and 1 - (f> s (0), 0/(0), cf> R (0) and 1 - 9(0) are all infinitesimally small. 
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Figure 2: Flow diagram for the flux of partners through different states. The top three boxes <j)s, and 4>R 
represent the different states the partner can be in if it has not transmitted. The lower box 1 — 8 is the probability 
the partner has transmitted. 



Generalizing the model 



In 15 , a number of generalizations to various dynamic network structures were considered. The basic 



approach we have used here can be applied to any of those generalizations. 



Final size relation 



The final size relation assuming small initial condition is well-known 19 . The final size relation for larger 



initial conditions has recently been found |14| in a more general case not assuming constant transmission 
and recovery rates. It can be derived easily for this model by setting 6> = 0. We find 



P 



+ 7 



/3 + 7 



R(oo) = 1 - V(6»(oo)) 



(3) 
(4) 



Results and Discussion 
Model Validation 

In this section we compare our model with simulations for populations which satisfy the Configuration 
Modcl/Molloy-Reed model assumptions. Although an earlier version of our equations was found to have 
minor discrepancies we show that once we appropriately account for the initial condition, the calculation 
becomes correct. 



Final Size Comparison 

To show that our new equations accurately calculate the impact of the initial conditions, we first consider 
epidemic spread in networks with the same degree distribution as in ]Y , but with varying numbers infected 
and varying population sizes. We then consider the impact of selecting high or low degree nodes as the 
earliest infected individuals, using networks whose degree distributions more clearly show the impact of 
biased selection of the initial individuals. 

We run a large number of simulations for each number of initial infections. For each simulation we 
generate a new network. Our simulation technique is similar to those recently described by [2[|4j[20j. In the 
Configuration Model framework, each node is assigned a degree, nodes are given stubs (or half-edges), and 
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Figure 3: Results of simulations for 100, 1000, and 10000 individuals. The solid curve gives our prediction for the 
final sizes of epidemic in a large population. Colors are log scale giving frequency of that particular epidemic size. 
In the case of 100 individuals we take every possible initial condition from 1 to 100 infections and perform 50000 
simulations. In the case of 1000 individuals, we take every possible initial condition from 1 to 1000 infections and 
perform 11500 simulations. In the case of 10000 individuals, however, we only look at multiples of 10 initial infections, 
performing 3000 simulations at each level. To show that the number of simulations performed accurately capture the 
full range, we show an increased number of simulations from 17.5% to 22.5%, performing 2000000, 50000, and 10000 
simulations. It is clear this increased number of simulations has no significant impact. 

then stubs are randomly paired together. In the simulations we use, each node is assigned a degree, nodes 
are given stubs, and then the disease begins to spread in the network before stubs are paired. Each time 
the disease transmits along a stub that stub is randomly paired with another as yet unpaired stub. If the 
partner is susceptible, then it becomes infected. If not, nothing happens. Once stubs are paired they remain 
in their edge. This approach is equivalent to constructing the network in advance and then following the 
disease, but it is more efficient because it only constructs those parts of the network the disease traces. 

Randomly selected initial infections We first consider varying numbers of randomly chosen infected 
individuals. In figure[3]we take the degree distribution from [jj. We have P(l) = 18.118 x 10~ 3 , P(2) = 
72.536 x 1(T 3 , P(3) = 145.222 x 1CT 3 , P(4) = 194.589 x 1CT 3 , P(5) = 195.962 x lfr 3 , P(6) = 156.857 x 
KT 3 , P(7) = 105.280 x 10~ 3 , P(8) = 59.713 x 10~ 3 , P(9) = 30.066 x 10~ 3 , and P(10) = 21.657 x 10~ 3 . 

We randomly select a proportion p of the population to initially infect. We have S(k,0) = 1 — p for all 
k, so tp(x) = (1 — p) J2k P(k)x k . Similarly we have </>s(0) = 1 — p. Because the epidemic begins with no 
recovered individuals, we take <f>n(0) — 0. We take j3 = 0.1 and 7 = 0.2 

We take populations of 100, 1000, and 10000 and perform many simulations. To compare with our 
predictions, we consider the final sizes observed, using the final size relation of equations ^ and Q to 
compare with simulations. The equations are derived in the infinite population limit, but we see that even 
with populations of only 100 they give a good prediction of the observed behavior. As the population size 
increases, the noise becomes less significant and the simulations collapse tighter around the prediction. 

Biased initial infections To show that the approach we have derived can also be applied to cases where 
the initial infected individuals are selectively chosen based on their degree, we use a different degree dis- 
tribution which helps highlight the effect. We take P(l) = P(9) = 1/2. We consider two options. In the 
first approach, individuals with higher degree are preferentially selected. To do the selection, we choose an 
individual with probability proportional to the square if its degree, and infect it. We repeat this, until a 
proportion p of the population is infected. In the second approach individuals are chosen with probability 
proportional to the square of their inverse degree until a proportion p is infected. We take j3 = 0.1 and 
7 = 0.6. 

Using these rules, we clearly see that S(k, 0) is not uniform. Instead, for the case where individuals are 
selected with probability proportional to their squared degree, we find that S(k,0) = a k where a solves 
X; fc P(fc)a fe2 =1-P- We find <f> s (0) = £ kS(k, 0)P(/c)a fe2 / J^k kP{k). In the case where individuals are 
selected with probability inversely proportional to their squared degree, we find that S(k, 0) = a x / k where 
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Figure 4: Epidemic final sizes in population of 1000 individuals with half having degree 9 and half with degree 1. 
The disease parameters are ft = 0.1, j = 0.6. Results of simulations having initial infections chosen with probability 
proportional to square of degree (left) or inverse square of degree (right). For each initial number of infections, 22500 
simulations were performed. For the range between 17.5% and 22.5%, 200000 simulations were performed to give 
insight into how well resolved the distribution is. 

a solves £ fc -P(fc)a 1/fe2 = 1 - p, and S (O) = £ kS(k, 0)P(k)a 1 / k2 / kP(k). 

We compare predictions and simulations in populations of 1000 individuals in figure |4j In the limit of 
a negligible initial proportion infected, the final size of epidemics in these networks is about 4%. As we 
increase the number of initially infected individuals, we increase the final size because of these individuals 
and because of the additional infections they lead to. At small amounts, increasing the number of high degree 
nodes has a much larger impact on the final size because they cause more additional infections. However, 
as the amount of infection initially present is increased this effect becomes less important: the high degree 
individuals would become infected anyway. So the largest gain in final size comes from infecting low degree 
individuals who would not receive an infection from their partners. The "kinks" that occur just above 50% 
initially infected are because effectively all individuals of high (left) or low (right) degree are initially infected. 

Dynamic Calculation 

We now look at the performance of the dynamic equations. The dynamic prediction is more easily affected by 
noise than the final size prediction, so we use larger population sizes. We again take the degree distribution 
of [7]. We begin with 5% infected, either randomly chosen, or chosen as before proportional to the square of 
the degree. A comparison of simulation with calculations is in figure [5j The theory accurately predicts the 
dynamics of epidemics. 

Intervention Impact 

We can use our equations to compare the impact of several interventions. We consider an epidemic spreading 
in the population, and at some intermediate time we introduce a change in the disease or population. Because 
the system changes at a time with a non-negligible amount of infection in the population, the equations 
derived assuming a negligible proportion infected fail. 

Consider a population in which P(4) = P(5) = P(6) = 1/3. Assume we initially infect a small, randomly 
chosen proportion of the population, p at t = 0. Thus we have tp(x) = (1 — p)(x 4 + x 5 + x 6 ) /3, (f>s{0) = 1 — p, 
0/(0) = p, and cf> R (0) = 0. We take /? = 1 and 7 = 1/2. 

We consider three interventions which may be introduced at time t±. All are aimed at "halving" the 
transmission rate, but they do this in different ways. In mass-action based models, these would all have the 
same effect. We can clearly identify differences using our approach. 

1. An intervention that reduces (3 by a factor of 2. 
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Figure 5: A comparison of the observed and predicted number of infections from simulations. Left: 5% initially 
infected, chosen randomly from the population. Right: 5% initially infected, chosen with probability proportional to 
squared degree. 

2. An intervention that reduces /3 so that per-contact transmission probability (3/(/3 + 7) is reduced by a 
factor of 2. 

3. An intervention that eliminates half of the partnerships randomly. 

The distinction between the first two comes from the fact that partnerships have duration. The expected 
number of transmissions an individual sends to a partner is /3/7, but only the first is successful. If we use 
mass action assumptions however, each transmission is to a replacement partner, and so halving /3 halves the 
total number of transmissions. When we account for infinite partnership duration, the expected number of 
transmissions remains the same, but some partnerships transmit more than average, so others must transmit 
fewer. The probability of transmitting at least once is /?/(/? + 7). So to reduce infection probability by a 
factor requires a larger reduction to j3. Note that the work of [l2j suggests that in Configuration Model 
networks the final size of our second and third intervention will be the same, (but that in clustered networks 
it will be different). 

We will demonstrate our approach in all three cases, restarting the calculations when the intervention 
is put into place. In all cases, this allows us to use the conditions at h to predict the final size. We take 
ipo(%) — 1 P{ x )i ^5,0: 0j,Oj an d 4>r,o to correspond to time less than t\. We use a subscript of 1 for times 
after h- We solve the original equations, and then use the results to initialize the second set of variables. 

Case 1 We begin by reducing j3 by a factor of 2 at time t\. Until time t\, we are solving the original 
equations. By solving the original system until h we have 60(h)- The probability an individual of degree k 
is susceptible at time t\ is S(k,ti) = S(k, 0)9 (ti) k . So our new ip(x) is tpi(x) = ^ k P(k)S(k,Q)6 (t{) k x k = 
ip(6o(h)x)- We take our new B\ to have 61(h) = 1. The intervention we are doing has no impact on the 
probability a partner is in any given state. cf>g, (f>j, and 4>r keep the same proportion, but are scaled up to 
sum to 1 so each is scaled by 60(h). For example, <ps,i(h) = <t>Sfi(h)/0 (h) = <f>s,o(0)%(6o(h)) /%(^o(h)- 
We restart the solutions with these new values. 

Case 2 The total probability of transmitting to a partner is (3/(f3 + 7). For this intervention we change j3 
so that P/(/3 + 7) is reduced by a factor of 2 at time h- This proceeds exactly as above except that the new 
value of j3 must be smaller. 

Case 3 When we delete half the edges at random, we do not affect the probability that a random partner 
is in any given state. So the <f> variables rescale in the same way as for changing j3 in the previous cases. 
However, ip undergoes a more significant change. As a starting point, consider Pi(k\), the probability an 
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Figure 6: The impact of interventions. Epidemics begin at t = with 0.001 of the population infected. Left: 
epidemic curve without interventions, and with each intervention introduced at time t\ = 1.5. Right: horizontal axis 
is ti, showing final effectiveness if interventions introduced at different times. 



individual has degree k\ after edges are deleted. This depends on Po{ko), the probability of having fco edges 
prior to deletion. The relation is 

»)=E»)C)(i) k 

fco 

The probability the individual has degree k\ and is susceptible is 

So if we restart the calculations at t = t\ we have S(ki,ti) = Q{k\)/P\(k\) and 

iP x {x)=Y d S{k u t l )Pi{kx)x k i 
fci 

fci 

fci fco 

=£p„(m*(^(^eG:) (£f~ fcI (f) fei 

fco fei 

= EPo(fco)5(fc o ,O)0o(ti)' £( 
= Vk> f fio(*i) 

(in general if we delete edges with probability p and keep with probability q = 1 — p, then the new function 
is ip\{x) = ipo([9o(ti)]\p + qx])). Using this new tpi(x), the same system of equations holds. 

Figure[6]compares these strategies. As anticipated, the final sizes resulting from cases 2 and 3 are identical, 
regardless of the time of intervention. However, we see that the dynamics are significantly different. 
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Figure 7: Bifurcation diagram with 0r(O) = 0, 4>i(Q) = 1 — 0s(O), and 0s(O) as given. Disease parameters are 
ft — 1 and 7 = 1.5. In each all members of the population have degree either 3 or 4, with the proportions chosen 
so that 0"(1)/-0'(1) takes the values on the horizontal axis. Approximate curves come from equation |5|. Only the 
equilibria with fj, < are physically meaningful. 



Bifurcation analysis 

We try to gain a better understanding of the epidemic transition and what happens to the final size as the 
initial proportion infected is increased. Consider the final size relation found from 
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If 0s (0) + 0/j(O) = 1, then we find that 9 = 1 is a solution to these equations. Physically this states that 
if there is no infection initially [0/(0) — 0] there will be no infection later. To study what happens when 
0/(0) > (but possibly arbitrarily small), we begin by first analyzing the structure of the dynamical equation 
for 9 under the assumption that 0s(O) = 1 and 0,r(O) = 0, taking 8 < 1. These assumptions contradict our 
initial conditions, but understanding this system first will lead to an easier understanding of the full system 
with 0/(0) > 0. 

If 05(0) = 1 and 0/j(O) = 0, the equation for 9 becomes 



which has 



whenever 



/V(i) +7 



/3 + 7 

Clearly 9 = 1 is an equilibrium. Close to 9 = 1, we write = 1 + (i, so -0'(6') /■)/'' (1) = 1 - 
H 2 ijj'"(l)/2ip'(l) + 0(fi 3 ). Substituting this into the equation for the equilibrium we have 



/^"(i)/y>'(i) 



1 + /x = 1 + fi/3 



V>"(l)+^"'(l)/2 + 0(M 2 ) 



which yields 



fi = 



/' : 



^'(1)08+7) 



^'(1)08 + 7) 



(5) 



V>"'(i) 

So there is a bifurcation as the bracketed term passes through zero, when (f3 + j)/f3 = ip" (1) / ijj 1 (1) . This 
is the well-known epidemic threshold 19 . The bifurcation is transcritical and corresponds to TZq increasing 
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through 1. If ip" (1) / ip' (1) < (/3 + 7)//3 (that is TZq < 1) then /i is positive and the corresponding equilibrium 
has > 1 and is unstable, while the equilibrium at = 1 is stable. In our case, we will not observe 9 > 1 
because 8 is a probability. If however if)" (1) / tp' (1) > (/? + j)/j3 (that is TZq > 1), then the corresponding 
equilibrium has 9 < 1 and is stable while the equilibrium at 9 = 1 is unstable. 

We are now able to consider the effect of realistic initial conditions. We keep 4>r(0) = 0, but take <fii(0) 
to be a small positive number with 9(0) = 1 and <j>s(0) = 1 — 4>i(0). The bifurcation diagram changes 
slightly. Compared to the equations assuming </>s(0) = 1, this has the effect of decreasing 9 slightly, so the 
equilibrium values shift. 

Below the bifurcation, the equilibrium 9 = 1 is slightly reduced to a 6q < 1, but remains stable. The 
solution with initial condition 9 = 1 converges to this equilibrium. Above the bifurcation, the 0=1 
equilibrium is slightly increased to a 0q > 1 and is unstable. The other equilibrium with smaller 9 is stable 
and its location is decreased slightly. Thus the solution with initial condition = 1 decreases and converges 
to the stable solution. 

This bifurcation diagram helps explain some of the apparent discrepancies of the earlier models. When 
TZq < 1 and 9 < 1, the earlier models suggested that > 0, that is, the probability a neighbor has 
transmitted reduces in time. This results from the fact that the model assumed a negligible initial proportion 
infected, and so it could not capture the fact that the stable equilibrium at 8 = 1 is reduced slightly by 
the initial condition. In these equations, the system converges to an equilibrium that is in the wrong place. 
Similarly, when TZq > 1, but the initial proportion infected is not negligible, the system again converges to 
an equilibrium in the wrong position. This is what happened in the [7] paper which started with a very 
small initial condition, and observed a very small discrepancy in the final size. 



Conditions leading to failure of model 

There are some assumptions implicit in our derivation which deserve further attention. The model fails if 
0(0), (ps(0), 0/(0), or 4>r(0) depend on degree of u. So if for example, we select high degree individuals and 
then infect their partners (leaving the high degree individuals uninfected), the model will not account for 
the fact that higher degree individuals are more likely to have infected partners at t = 0. The approach will 
fail. 

It does not fail if the initial individuals infected have higher (or lower) degree. This simply affects 
the initial conditions. Indeed, we expect that if the infection is initially spreading stochastically in the 
population, and we set t = to be when enough cases are infected to have deterministic behavior, we will 
see that at t = a disproportionate number of higher degree individuals have been infected. This does not 
present a challenge. 



Discussion 

We have shown that recent techniques used to derive epidemic dynamics in networks may be adapted to 
situations in which the initial condition is not small. The resulting equations are relatively simple to solve 
numerically. We have shown how these equations can be used to derive a final size relation. Our results 
correct an apparent discrepancy seen in earlier work comparing equations of |22| with simulations of |7j. 

One of the most obvious applications of these results is to the understanding of the impact of an in- 
tervention which begins after a disease has established itself. This has been a weakness of network models 
for some time: the earliest models could only calculate static quantities such as the final size of epidemics 



assuming no intervention, while more recent approaches that calculate the dynamics 13 15 22 have been 
restricted to the assumption of asymptotically small initial conditions, again with no change in the popula- 
tion. Because we now have a model which can account for large initial conditions, we can use this to restart 
our calculations when an intervention is to be implemented, or we can use the final size relation to quickly 
compare intervention effectiveness. 

We have analyzed the bifurcation structure of the final size relation, and used this to explain an apparent 
discrepancy in earlier work if TZq < 1. The previous models that assumed small initial condition also 
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implicitly assume that TZq > 1. This resulted in a disturbing prediction for TZq < 1 that transmissions could 
be reversed as time progresses, and infected individuals are uninfected. Once we correctly account for the 
initial condition this apparent discrepancy disappears. 

If variance is large enough, then there may be a small number of very high degree individuals who have a 
macroscopic effect on the dynamics. If we increase population size to "drown out" their signal, we expect to 
have a small number of much higher degree individuals who again have a macroscopic effect on the dynamics. 
Deterministic predictions will not be accurate: for example, how long the highest degree individual remains 
infected will influence the final size. The work of [4] rigorously studied the equations using small initial 
conditions, and showed that if all moments up to the fifth moment were finite, then these equations are 
accurate in the limit of a large network. Whether all these moments are necessary is unclear (the equations 
are well-behaved so long as the second moment is finite). Regardless, when the equations do not work, it 
is due to high degree individuals, so if the high degree individuals are removed from the population, the 
equations will work. 

After the epidemic has run for a short period of time, all of these high degree individuals have been 
infected and recovered. The remaining population will have significantly reduced moments. At this point, 
the stochastic effects are "frozen in" : the dynamics are now deterministic. We can use these conditions to 
initialize our new system of equations. 

Conclusions 

Recent advances in our understanding of infectious disease spread in networks have allowed us to accurately 
predict SIR disease spread in a range of networks, under the assumption of a negligibly small initial condition. 
However, in many contexts, such as might occur when an intervention is applied, the small initial condition 
assumption is false and the models give inaccurate predictions. The method we have introduced allows us 
to modify the previous equations and eliminate the assumption that the initial condition is small. 

Our modeling approach accurately predicts the size and dynamics of simulated epidemics with arbitrary 
sized initial conditions. The approach allows us to compare interventions introduced during the epidemic, 
which is not possible with previous network-based approaches. 

Our system of equations ([lj and ([2| are mathematically simple and can be solved numerically with 
standard tools. Changes in the population's degree distribution do not alter the structure of the equations, 
and in particular, the population can have arbitarily large maximum degree without requiring any increase 
in the number of equations. 
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